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Abstract 



We show that for any constant d, complex roots of degree d univariate rational (or 
Gaussian rational) polynomials — given by a list of coefhcients in binary — can be computed 
to a given accuracy by a uniform TC" algorithm (a uniform family of constant-depth 
polynomial-size threshold circuits). The basic idea is to compute the inverse function of 
, the polynomial by a power series. We also discuss an application to the theory VTC'^ of 

^ ■ bounded arithmetic. 

in" 

^ '. 1 Introduction 

■ The complexity class TC'' was originally defined by Hajnal et al. [15] in the nonuniform 

setting, as the class of problems recognizable by a family of polynomial-size constant-depth 
circuits with majority gates. It was implicitly studied before by Parberry and Schnitger [27], 
who consider various models of computation using thresholds (threshold circuits, Boltzmann 
^ I machines, threshold RAM, threshold Turing machines). The importance of the class follows 

I already from the work of Chandra, Stockmayer, and Vishkin [7], who show (in today's ter- 

minology) the TC'^-completeness of several basic problems (integer multiplication, iterated 
addition, sorting) under AC'' reductions. Barrington, Immerman, and Straubing [4] establish 
that there is a robust notion of fully uniform TC''. (We will use TC'' to denote this uniform 
TC'', unless stated otherwise.) 

We can regard TC'' as the natural complexity class of elementary arithmetical opera- 
tions: integer multiplication is TC'^-complete, whereas addition, subtraction, and ordering 
are in AC'^ C TC". The exact complexity of division took some time to settle. Wallace [33] 
constructed division circuits of depth 0((logn)^) and bounded fan-in (i.e., NC^). Reif [29] 
improved this bound to 0(logn log log n). Beame, Cook, and Hoover [5] proved that division, 
iterated multiplication, and exponentiation (with exponent given in unary) are TC°-reducible 
to each other, and constructed P-uniform TC'' circuits for these problems. Chin, Davida, and 
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Litow [8] exhibited logspace-uniform TC° circuits for division, showing in particular that di- 
vision is computable in L. Finally, Hesse, Allender, and Barrington [16] proved that division 
(and iterated multiplication) is in uniform TC*^. 

Using these results, other related problems can be shown to be computable in TC*^, for 
example polynomial division, iterated multiplication, and interpolation. In particular, using 
iterated addition and multiplication of rationals, it is possible to approximate in TC° functions 
presented by sufficiently nice power series, such as log, exp, .t^/^, and trigonometric functions, 
see e.g. Reif [29], Reif and Tate [30], Maciel and Therien [20], and Hesse et al. [16]. 

Numerical computation of roots of polynomials is one of the oldest problems in mathemat- 
ics, and countless algorithms have been devised to solve it, both sequential and parallel. The 
most popular methods are based on iterative techniques that successively derive closer and 
closer approximations to a root (or, sometimes, to all the roots simultaneously) starting from a 
suitable initial approximation. Apart from the prototypical Newton-Raphson iteration, there 
are for instance Laguerre's method [28, §9.5], Brent's method [28, §10.3], the Durand-Kcrncr 
method [13, 19], the Jenkins-Traub algorithm [17], and many others. One can also reduce 
root finding to matrix eigenvalue computation, for which there are iterative methods such 
as the QR algorithm [14]. Another class of root-finding algorithms are divide-and-conquer 
approaches: the basic idea is to recursively factorize the polynomial by identifying a suitable 
contour (typically, a circle) splitting the set of roots roughly in half, and recovering coeffi- 
cients of the factor whose roots fall inside the contour from the residue theorem by numerical 
integration. Algorithms of this kind include Pan [24], Ben-Or et al. [6], Neff [22], Neff and 
Reif [23], and Pan [25], see Pan [26] for an overview. These algorithms place root finding in 
NC: for example, the algorithm of [25] can find n-bit approximations to all roots of a poly- 
nomial of degree d < n in time 0((logn)^(log(i)^) using 0(nd^(loglogn)/(log(i)^) processors 
on an EREW PRAM. (More specifically, Allender [3] mentions that root finding is known to 
be in the #L hierarchy, but not known to be in GapL.) 

The purpose of this paper is to demonstrate that in the case of constant-degree polynomi- 
als, we can push the complexity of root finding down to uniform TC'^ (i-e., constant time on 
polynomially many processors on a TRAM, in terms of parallel complexity), as in the case 
of elementary arithmetical operations. (This is clearly optimal: already locating the unique 
root of a linear polynomial amounts to division, which is TC^'-hard.) As a corollary, the 
binary expansion of any algebraic constant can be computed in uniform TC" when given the 
bit position in unary. Our primary interest is theoretical, we seek to investigate the power of 
the complexity class TC°; we do not expect our algorithm to be competitive with established 
methods in practice, and we did not make any effort to optimize parameters of the algorithm. 

The basic idea of the algorithm is to express the inverse function of the polynomial by a 
power series, whose partial sums can be computed in TC° using the results of Hesse et al. [16]. 
We need to ensure that coefficients of the series are TC^-computable, we need bounds on the 
radius of convergence and convergence rate of the series, and we need to find a point in whose 
image to put the centre of the series so that the disk of convergence includes the origin. Doing 
the latter directly is in fact not much easier than approximating the root in the first place, 
so we instead construct a suitable polynomial-size set of sample points, and we invert the 
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polynomial at each one of them in parallel. 

We formulated our main result in terms of computational complexity, but our original 
motivation comes from logic (proof complexity). The bounded arithmetical theory VTC^ (see 
Cook and Nguyen [12]), whose provably total computable functions are the TC'^ functions, 
can define addition, multiplication, and ordering on binary integers, and it proves that these 
operations obey the basic identities making it a discretely ordered ring. The question is 
which other properties of the basic arithmetical operations are provable in the theory, and 
in particular, whether it can prove induction (on binary integers) for some class of formulas. 
Now, it follows easily from known algebraic characterizations of induction for open formulas in 
the language of ordered rings {lOpen, see Shepherdson [32]) and from the witnessing theorem 
for FTC^ that VTC'^ proves lOpen if and only if for each d there is a TC*^ root-finding 
algorithm for degree d polynomials whose soundness is provable in VTC^. Our result thus 
establishes the computational prerequisites for proving open induction in VTC^, leaving aside 
the problem of formalizing the algorithm in the theory. Since the soundness of the algorithm 
can be expressed as a universal sentence, we can also reformulate this result as follows: the 
theory FTC° + Thy^s (N) proves lOpen. 

The paper is organized as follows. In Section 2 we provide some background in the 
relevant parts of complexity theory and complex analysis. Section 3 contains material on 
inverting polynomials with power series. Section 4 presents our main result, a TC" root- 
finding algorithm. Finally, in Section 5 we discuss the connection to bounded arithmetic. 

2 Preliminaries 

A language L is in nonuniform TC° if there is a sequence of circuits C^: {0, 1}" — > {0, 1} 
consisting of unbounded fan-in majority and negation gates such that C„ computes the char- 
acteristic function of L on strings of length n, and C„ has size at most n"^ and depth c for 
some constant c. 

L is in (uniform) TC^ , if the sequence {C„ : n G w} is additionally DLOGTIME-uniform 
(J/D-uniform in the terminology of Ruzzo [31]): i.e., we can enumerate the gates in the circuit 
by numbers i < n^^^^ in such a way that one can check the type of gate i and whether gate i is 
an input of gate j by a deterministic Turing machine in time O(logn), given n,i,j in binary. 
There are other equivalent characterizations of TC''. For one, it coincides with languages 
recognizable by a threshold Turing machine [27] in time O(logn) with 0(1) thresholds [2]. 
Another important characterization is in terms of descriptive complexity. We can represent a 
string X G {0, 1}" by the first-order structure ({0, . . . , n — 1}, <, bit, X), where X is a unary 
predicate encoding the bits of x. Then a language is in TC° iff its corresponding class of 
structures is definable by a sentence of FOM (first-order logic with majority quantifiers). We 
refer the reader to [4] for more background on uniformity of TC''. 

In some cases it may be more convenient to consider languages in a non-binary alphabet 
S. The definition of TC'' can be adapted by adjusting the input alphabet of a threshold 
Turing machine, or by considering more predicates in the descriptive complexity setting. In 
the original definition using threshold circuits, the same can be accomplished by encoding 
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each symbol of S with a binary substring of fixed length. We can also define TC predicates 
with more than one input in the obvious way. 

A function /: {0, 1}* — ;> {0, 1}* is computable in TC*^ if the length of its output is polyno- 
mially bounded in the length of its input, and its bitgraph is a TC° predicate. (The bitgraph 
of / is a binary predicate b{x, i) which holds iff the zth bit of f{x) is 1.) In terms of the original 
definition, this amounts to allowing circuits C„: {0,1}" — >■ {0, 1}"**^"), where m(n) = -nP^^^ 
TC° functions are closed under composition, and under "parallel execution": if / is a TC 
function, its aggregate function g{{xo, ■ ■ ■ ,Xjn-i)) = {f{xo), . . . , f{xm-i)) is also in TC°. We 
note in this regard that TC° functions can do basic processing of lists 

•^0; -^Ij ■ ■ ■ ) 

where "," is a separator character. Using the fact that TC*^ can count commas (and other 
symbols), we can for instance extract the iih element from the list, convert the list to and 
from a representation where each element is padded to some fixed length with blanks, or sort 
the list according to a given TC° comparison predicate. 

We will refrain from presenting TC*^ functions in one of the formalisms suggested by the 
definitions above: we will give informal algorithms, generally consisting of a constant number 
of simple steps or TC^ building blocks, sometimes forking into polynomially many parallel 
threads. The reader should have no difficulty convincing herself that our algorithms are 
indeed in TC°. 

We will work with numbers of various kinds. Integers will be represented in binary as 
usual, unless stated otherwise. As we already mentioned in the introduction, elementary 
arithmetical operations on integers are TC" functions: this includes addition, subtraction, 
ordering, multiplication, division with remainder, exponentiation (with unary exponents), 
iterated addition, iterated multiplication, and square root approximation. Here, iterated 
addition is the function (xq, . . . ,Xm-i) i->- Yli<m^i^ similarly for multiplication. Notice 
that using iterated multiplication, we can also compute factorials and binomial or multinomial 
coefficients of unary arguments. Base conversion is also in TC*^. 

Rational numbers will be represented as pairs of integers, indicating fractions. We cannot 
assume fractions to be reduced, since integer gcd is not known to be TC°-computable. Using 
integer division, we can convert a fraction to its binary expansion with a given accuracy 
(the opposite conversion is trivial). Rational arithmetic is reducible to integer arithmetic 
in the obvious way, hence rational addition, subtraction, ordering, multiplication, division, 
exponentiation (with unary integer exponents), iterated addition, iterated multiplication, and 
square root approximation are in TC". 

In lieu of complex numbers, we will compute with Gaussian rationals (elements of the 
field Q{i)), represented as pairs of rationals a + ib. By reduction to rational arithmetic, we 
can see that addition, subtraction, complex conjugation, norm square, norm approximation, 
multiplication, division, and iterated addition of Gaussian rationals are in TC°. Using the 
binomial theorem, exponentiation with unary integer exponents is also in TC*^. (In fact, 
iterated multiplication of Gaussian rationals is in TC" using conversion to polar coordinates, 
but we will not need this.) 
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We will need some tools from complex analysis. We refer the reader to Ahlfors [1] or 
Conway [10] for background, however, we review here some basic facts to fix the notation. 
A function /: {/ — )■ C, where {/ C C is open, is holomorphic (or analytic) in U if /'(a) = 
limz^aif (z) ~ f{^))/{^ ~ C') exists for every a £ U. The set of all functions holomorphic in 
U is denoted H{U). Let B{a,r) := {z : \z — a\ < r} and B{a,r) := {z \z — a\ < r}. If / is 
holomorphic in the open disk B{a, R), it can be expressed by a power series 

oo 

fiz) = J2cniz-ar 

n=0 

on B(a,R). More generally, if / is holomorphic in the annulus A = B(a,R) \ B(a,r), 
< r < R < oo, it can be written in ^ as a Laurent series 

+ 00 

fiz)= Yl cniz-ar. 

n=—oo 

We denote the coefficients of the series by [{z — a)"]/ := c„. (Other variables may be used 
instead of z when convenient.) The residue of / at a is Res(/, a) := [{z — a)~^]f. When a = 0, 
we write just [z"]/ and Res(/), respectively. The coefficients of a Laurent series are given by 
Cauchy's integral formula: 

where 7 is any closed curve in A whose index with respect to a is 1 (such as the circle 
7(t) = a + ge'^^'^'', r < g < R). The identity theorem states that if f,g are holomorphic in a 
region (i.e., connected open set) U and coincide on a set X C U which has a limit point in 
U, then f = g. The open mapping theorem states that a nonconstant function / holomorphic 
in a region is an open mapping (i.e., maps open sets to open sets). 
If X C C and a G C, we put dist(a, X) = m{{\z - a\ : z E X}. 

We will also need some easy facts on zeros of polynomials. Let / € C[x] be a degree d 
polynomial, and write f{x) = Yl'j=Q^3^^ ■ Cauchy's hound [34, L. 6.2.7] states that every zero 
q; of / satisfies 

1 — I 1 — r < S 1 + max — 

an + max a, 3<d \aA 

o<j<d ■' 

Let /, 5 G (Q(i))[x] be two polynomials of degrees d,e (resp.), and assume /(a) = g{P) = 0, 
a^p. ltf,ge (Z[z])[x], we have 

(2«ll/M1Mlf 

where denotes the Lp-norm of the vector of coefficients of / [34, §6.8]. In general, we can 
apply (*) to the polynomials rf and sg, where r is the product of all denominators appearing 
among the coefficients of /, and similarly for s. If we represent / and g by the lists of their 
coefficients, which are in turn represented by quadruples of binary integers as detailed above, 
we obtain easily the following root separation bound: 
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Lemma 2.1 For each j = 0,1, let fj G (Q(i))[a;] have degree dj and total bit size nj, and 
assume fj{aj) = 0. If ao ^ ai, then 



3 Inverting polynomials 

As already mentioned in the introduction, the main strategy of our algorithm will be to 
approximate a power series computing the inverse function of the given polynomial /. In this 
section, we establish the properties of such series needed to make the algorithm work. 

The basic fact we rely on is that holomorphic functions with nonvanishing derivative are 
locally invertible: i.e., if / G H{U) and a £ U is such that /'(a) ^ 0, there exist open 
neighbourhoods a £ Uq C. U and /(a) € Vq such that / is a homeomorphism of Uq onto Vq, 
and the inverse function g = {f \ Uo)~^ is holomorphic in Vq. In particular, g is computable 
by a power series in a neighbourhood of /(a). 

Notice that local inverses of holomorphic functions are automatically two-sided: if / G 
H{U), g G H{V), a e U, b e V, g{b) = a, and f{g{z)) = z in a neighbourhood of b, then 
g{f {z)) = z in a neighbourhood of a. 

The coefficients of the power series of an inverse of a holomorphic function are given by 
the Lagrange inversion formula [9, §3.8, Thm. A]: 

Fact 3.1 Let f G H{U), g G H{V), fog = idy, a = g{b) eU,b = f{a) G n > 0. Then 

We can make the formula even more explicit as follows. First, the composition of two power 
series is given by Faa di Bruno's formula [9, §3.4, Thm. A], which we formulate only for 
a = 6 = for simplicity: 

Fact 3.2 Let f G H{U), g G H{V), g{Q) = G C/, /(O) = G n > 0. Then 




Note that here and below, the outer sum is finite, and the product has only finitely many 
terms different from 1, hence the right-hand side is well-defined without extra assumptions 
on convergence. We can now expand the residue in Fact 3.1 to obtain the following version 
of Lagrange inversion formula, which only refers to the coefficients of / [9, §3.8, Thm. E]: 

Proposition 3.3 Let f G H{U), g G H{V), fog = idy, a = g{b) eU,b = f{a) G V. Then 
[{w — b)^]g = a, and for n > 0, 
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Proof: Note that f'{a) ^ 0. Put fi{z) = f{a + z/f'{a)) - b and gi{w) = f'{a){g{b + w) - a), 
so that /i(0) = = gi{0), fiogi = id on a neighbourhood of 0, and f[{0) = 1. Write 
fi{z) = z{l — h{z)), where h is holomorphic in a neighbourhood of 0, and /i(0) = 0. Then 



m = -\z = -\z 



oo 



using Facts 3.1 and 3.2, and the expansion 

(1 - u;)-" = f+"7^). The result follows by 
noting that for any fc,n > 0, = ([(z - a)'=]/)/(/'(a))^ \w^\g^ = f{a)[iw - 6)"]^, and 

f{a) = [z-a]f. □ 

Let d be a constant. If / in Proposition 3.3 is a polynomial of degree d, then the product 
is nonzero only when rrij = for every j > d, hence it suffices to enumerate m2, . . . , ma- It 
follows easily that the outer sum has polynomially many (namely, 0{n'^)) terms, and we can 
compute [{w — b)"']g in uniform TC*^ given a, b, and the coefficients of / in binary, and n in 
unary. 

Apart from a description of the coefficients, we also need bounds on the radius of conver- 
gence of the inverse series, and on its rate of convergence (i.e., on the norm of its coefficients). 
Generally speaking, the radius of convergence of a power series is the distance to the nearest 
singularity. Since a polynomial / is an entire proper map, its inverse cannot escape to infinity 
or hit a point where / is undefined, thus the only singularities that can happen are branch 
points. These occur at zeros of /'. This suggests that the main parameter governing the 
radius of convergence and other properties of the inverse should be the distance of a to the 
set Cf = {z E C : f'{z) = 0} of critical points of /. 

Lemma 3.4 Let f G C[x] be a degree d polynomial with no roots in B{a,R), R> 0, and let 
/x>0. Then \ f{z)-f{a)\ < {{1 + fif - l)\f{a)\ for all z e B{a, fiR). 

Proof: Write f{z) = cYlj^-^^{z — aj). We have 

fjz) _ (z - g) + (g - aj) _ ^ JJ '^"" 



f(a) a — aj ^-^ -'■-'■g — a, 



hence 



/(a) 



1^0 jei ■' 1^0 jei 
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Proposition 3.5 Let f G C[x] have degree d> 1, f{a) = h, and < R< dist(a, Cf). Let 

oc 

g{w) = a + ^c„(u;-6)" 



n=l 



satisfy f o g = idg(b,e)' '^here g > is the radius of convergence of g. Put 

(5 In In 4 



= ^1 + ^rfi/ - 1 > ^ , = i^i?|/'(a)| 

/or < (5 < 1 (i/ie inequalities are established below). Then: 
(i) f is injective on B{a,iJ,R). 
{a) g> go. 

(Hi) g[B{b, g)] 3 B(a, XiR), and more generally, g[B(b, Sgo)] 3 B{a, XsR) for each 5 G (0, 1]. 
(iv) \cn\ < nR/ng^. 

Proof: Notice that — 1 > x for every x G M, hence ''~^/2 — 1 = exp((ln2)/(d — 1)) — 1 > 
(ln2)/((i -l);iy> (In 4 - l)/d immediately foUows. Similarly, > ln(l + (5(ln4 - l))/d. We 
have ln(l + (5(ln4 — 1)) > 5 In In 4 for 5 G [0, 1] as In is concave. 

(i) : Let u,v E B{a,fiR), u ^ v. We have 

fiv) - f{u) = £ f\z) dz = {v- u) (^f'{a) + /'((I - t)u + tv) - f\a) dt^ . 

Since |/'((1 - t)u + tv) - f'{a)\ < \ f'{a)\ for all t G (0, 1) by Lemma 3.4, we obtain 

[' f'{{l-t)u + tv)-f'{a)dt < r\f'{{l-t)u + tv)-f'{a)\dt<\f'{a)\, 
Jo Jo 

thus fiu) ^ f{v). 

(ii) : Let U = B{a, fiR). Since / is a biholomorphism of U and f[U], g > dist(6, C \ f[U]). 
Since f[U] is open, there exists w ^ f[U] such that |w — 6| = dist(6, C \ / [[/]). Let z„ G [/ be 
such that lim„ f{zn) = w. By compactness, {z^} has a convergent subsequence; without loss 
of generality, there exists z = lim„z„. Then f{z) = w by continuity, hence z ^U. However, 
z eU, hence z is in the topological boundary dU = U -^intU = C/ \ C/. We have thus verified 
that g>dist{b,f[dU]). 

Let = a + uRe^^ G dU. We have 

f{u) = ^ + /'(^) dz = b + Re'^ {^nf'{a) + f'{a + te'^ R) - f'{a) dt^ . 
By Lemma 3.4, \ f'{a + te'^R) - f'{a)\ < ((1 + t)"^-^ - l) |/'(a)|, hence 

£ f'{a + te^'R) - f\a) dt < \f{a)\ j\l + t)''-^ -ldt = \f'{a)\ (^ill^lnl - 
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Thus, 



|/(^) -6| >R\f'{a)\ LJ^^^ = i^R\na)\. 



(iii): The proof above shows that g[B{b, go)] C [/. As / is injectivc on U ^ B{a,XsR), it 
suffices to show that f[B{a, XsR)] C B(b, 6go)- Let thus u = a + Ai^e*^, A < A^. As above, 

f{u) = b + Re'^ (^A/'(a) + £ f'{a + te'^^R) - f'{a) d?j 



and 



/ f'{a + te'^R)-f'{a)dt 
Jo 



<\f'ia)\(^-^±^^-X], 



hence 

\fiu) - b\ <R\f'{a)\ ^^^f-^ <R\f{a)\ ^^^^f-^ = Sgo- 
(iv): Let 7(i) = a + jiRe^'"'^^. By Fact 3.1 and Cauchy's integral formula, 

If dz fJ-R e^^'^^dt 



27rm J^ {f{z) - by n Jo {f{j{t)) - by 
The proof of (ii) shows \ f{'j{t)) — 6| > go, hence 



^^ I^R f dt iiR 
n Jo \fm)-b\n- ng^- 



Example 3.6 Let f{z) = z'^, a = b = I. Then /' = dz'^'^, Cj = {0}, R = 1, f\a) = d. It 
is not hard to see that / is injective on 5(1, r) iff no two points of B{l,r) have arguments 
differing by 27r/d iff r < sin(7r/d) = ■K/d + 0{d~^). Since g must hit a root of /' at the circle 
of convergence, we must have g = 1 = {l/d)Rf'{a). Finally, |(1 + z^ — 1| is maximized 
on {z : \z\ = r} for z positive real, thus B{1,XR) C g[B{l,Sg)] iff (1 + A)'' - 1 < S iff 
A < (1 + 5y/'^ — 1 = ln(l + S)/d + 0{d'''^). Thus, in Proposition 3.5, /x, v, and A^ are optimal 
up to a linear factor. 

Remark 3.7 We prefer to give a simple direct proof of Proposition 3.5 for the benefit of 
the reader. Nevertheless, we could have assembled the bounds (with somewhat different 
constants) from several more sophisticated results in the literature. The Grace-Heawood 
theorem (or rather its corollary, originally due to Alexander, Kakeya, and Szego; see [21, 
Thm. 23,2]) states that (i) holds with = sin(7r/(i) (which is tight in view of the z'^ example). 
Then the Koebe 1/4-theorem [11, Thm. 14.7.8] implies (ii) with u = ji/A, and one more 
application of the theorem yields (iii) with A^ = vS/A. 
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4 Root finding in TC° 

We start with the core part of our root-finding algorithm. While it is conceptually simple, its 
output is rather crude, so we will have to combine it with some pre- and postprocessing to 
obtain the desired result (Theorem 4.5). 

Theorem 4.1 Let d he a constant. There exists a uniform TC" function which, given the 
coefficients of a degree d polynomial f G (Q(i))[x] in binary and t in unary, computes a list 
{zj : j < s} C Q(i) such that every complex root of f is within distance 2~* of some Zj. 

Proof: If d = 1, it suffices to divide the coefficients of /. Assume d >2. Let ^,u,\ = A1/2 be 
as in Proposition 3.5 (more precisely, we should use their fixed rational approximations; we 
will ignore this for simplicity). Let A=l + \/^, p = |"57r/A], and ^ = e^'^*/^ (approximately, 
again). Consider the TC° algorithm given by the following description: 



(ii 

(iii 
(iv 

(v 
(vi 
(vii 
(viii 



Input: / = Y.j<d fj^^ ■^ith fj G Q(i), fd / 0, and t>Om unary. 

Put e = 2~*. Compute recursively a list C = {aj : j < s} including e/4-approximations 
of all roots of /'. 

Output (in parallel) each aj. 

Put c = 2 + maxj<^d\fj/fd\ and fcmax = [log(2ce~^)/ log A] . 
For every j < s, k < /cmax, and q < p, do the following in parallel. 
Let a = aj+ eA'^^i, b = f{a), R=\\a- aj\, N = [log2(M-R£"^)l . 
For each h < d, let A = ILLh O!)/""""'^- 
Compute and output 

(2m2 + • • • + dmd)\ {-h)""^ ■ ■ ■ (-/d)™d(_5)i+m2+-+(d-i)md 



m2,...,ma m2\---md 

Let /(a) = 0, we have to show that one of the numbers output by the algorithm is e-close 
to a. If |a — ajl < s for some j, we are done by step (iii). We can thus assume dist(a, C) > e, 
which implies dist(a, C/) > 3£/4. Assume that aj is an e/4-approximation of the root aj of 
/' nearest to a. Since all roots of / or /' have modulus bounded by c — 1 by Cauchy's bound, 
we have e < \a — aj] < 2c, thus there exists k < k^aax such that eA^ < \a — aj] < eA'^'^^. Let 
q < p he such that the argument of a — aj differs from 2TTq/p by at most ir/p, and consider 
steps (v)-(viii) for this particular choice of j,k,q (cf. Fig. 1). We have 

vr ^ 1 \ , , . 2A 

p^ ~A^ 



(1 + m2 + • • • + (d - l)md)\ fl+^rn,+-+dm. 



\oc — a\ < I — hi 7 1 \a — aj\ < — \a — aj\ < - \a — aj\. 



Notice that 

dist(a, C^) = \a — a.j\ > \a — aj' 
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Figure 1: The spiderweb. 



by the choice of aj and aj , hence 

dist(a, Cf) > dist(a, Cf) — \a — a\ > \a — aj\ — - — -\a — aj\ > -\(x — aj\ > R. 



Since 



we also have 



Let 



a — ajl > \a — ajl — \a — a\ > —\a — a 



I 5 2A , , , „ 

\a — a\ < — —— a — a,- = Ait. 
I '45 ^ 



n=l 



be an inverse of / in a neighbourhood of 6, and let q be its radius of convergence. By 
Proposition 3.5, |— 6| = |/(a) — 6| < £'o/2, where = iyR\f'{a)\ < g. Thus, g{f{a)) = g{Q) = 
a. Since f^z^ = f{z + a) by the binomial formula, fh = [{z — a)^]f. Then it follows from 
Proposition 3.3 that 



N 



Zj,k,q = a + J2cn{-bT. 



n=l 



Since 



2" 



by Proposition 3.5, we have 

|o ~ ^j,k,q\ 



n=N+l 



□ 
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Most of the algorithm described in Theorem 4.1 is independent of the assumption of d being 
constant (or it can be worked around). There are two principal exceptions. First, the recursion 
in step (ii) amounts to d sequential invocations of the algorithm. Second, while N is still linear 
in the size of the input, the main sum in step (viii) has roughly iV"^ terms. Thus, approximation 
of roots of arbitrary univariate polynomials can be done by (uniform) threshold circuits of 
depth 0{d) and size n'^^'^^ , where n is the total length of the input. (The known NC algorithms 
for root finding can do much better for large d.) 

The algorithm from Theorem 4.1 does the hard work in locating the roots of /, but it 
suffers from several drawbacks: 

• Its output includes a lot of bogus results that are not actually close to any root of /. 

• There may be many elements on the list close to the same root, and we do not get any 
information on the multiplicity of the roots. 

• The roots have no "identity": if we run the algorithm for two different is, we do not 
know which approximate roots on the output lists correspond to each other. 

• It may be desirable to output the binary expansions of the roots rather than just ap- 
proximations. 

We are going to polish the output of the algorithm to fix these problems. Let us first formulate 
precisely the goal. 

Definition 4.2 The t-digit binary expansion of a G C is the pair ([Re(a2*)J , [Im(a2*)J), 
where both integers arc written in binary. A root-finding algorithm for a set of polynomials 
P C (Q(i))[x] is an algorithm with the following properties: 

(i) The input consists of a polynomial f & P given by a list of its coefficients in binary, 
and a positive integer t in unary. 

(n) The output is a list of pairs {{zj{f,t),ej{f,t)) : j < s{f,t)}. 

(iii) For every f & P, there exists a factorization 

j<s 

where c € Q{i), aj G C, aj ^ for j ^ k, and Cj > 0, such that for every t: s{f, t) = s, 
ej{f,t) = Cj, and Zj{f,t) is the t-digit binary expansion of aj. 

We note that the choice of base 2 in the output is arbitrary, the algorithm can output expan- 
sions in any other base if needed. 

Lemma 4.3 Let d be a constant. Given a degree d polynomial f G {Q{i))[x], we can compute 
in uniform TC^ a list of pairwise coprime square-free nonconstant polynomials fj, c G Q{i), 
and integers Sj > such that f = cY[j^k fj^> where k, ej < d. 
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Proof: Since d is constant, division of degree d polynomials takes 0(1) arithmetical oper- 
ations, hence it can be implemented in uniform TC°. The same holds for gcd, using the 
Euclidean algorithm. We compute a list L = {fj : j < k), k < d, of nonconstant polynomials 
such that / = Y\j fj as follows: 

(i) Start with L = (/). Repeat the following steps until none of them is applicable. 

(ii) If fj is not square-free, replace it with gcd(/j, /j) and fj/ gcd(/j, /j). 

(iii) If fh I fj, fj \ fh for some h,j, replace fj in L with fh, fj/fh- 

(iv) If g := gcd{fh, fj) / 1 for some h,j such that fh \ fj, fj \ fh, replace fh, fj in L with 

g,g,fh/g,fj/g- 

The algorithm terminates after at most d steps, hence it is in TC°. Clearly, it computes a 
list of square- free polynomials such that for every h,j, fh is coprime to fj or fh is a scalar 
multiple of fj. It remains to collect scalar multiples of the same polynomial together. □ 

Lemma 4.4 Let d be a constant. Given a degree d square-free polynomial f G (Q(z))[x] and 
t in unary, we can compute in uniform TC° a list {zj : j < s} such that every root of f is 
within distance 2~* of some Zj , and every Zj is within distance 2"* of some root. 

Proof: We use the notation from the proof of Theorem 4.1. We modify the algorithm from 
that proof as follows: 

• We compute an £o > such that the distance of any root of / to any root of /' is at 
least £o using Lemma 2.1. In step (ii), we put e = min(2~*, £o/3). 

• We skip step (iii). 

• In step (vi), we check that \b\ < ^i'\f'{a)\R and \a — aj'\ > R + e/4 for every / < s. If 
either condition is violated, we output a symbol "*" instead of a number, and skip the 
remaining two steps. 

The result is a list of numbers and *'s; it is easy to construct the sublist consisting of only 
numbers by a TC° function. 

Let Zj^k,q be one of the numbers output by the algorithm. In step (vi) we ensured 
dist(a, C) > i? + e/4, hence dist(a, Cj) > R. Moreover, |0 — 6| < Qo/2, hence is within 
the radius of convergence of g, and a = ^(O) is a root of / whose distance from Zj^k,q is 



fiR ^ 



\a-Zj,k,q\= Yl 

n=N+l 

On the other hand, let a be a root of /. Since dist(Q!,Cj) > we have dist(Q!,C) > e, 
hence wc can choose j, k, q such that \a — Zj^k^q\ < e as in the proof of Theorem 4.1. We have 
to show that the extra conditions in step (vi) are satisfied. |6| < ^i'R\f'{a)\ was verified in 
the proof of Theorem 4.1. Moreover, 

a — ai'l > a — a,/ — \a — a\ — - > a — cto — a — a — - > — a — a,- > R + — 

I J I - I jii 14-1 -^11 '4~5 -"2" 4 

as \a — aj\ > £0 — e/4 > |£. □ 
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We can now finish the proof of the main result of this paper: 

Theorem 4.5 For every constant d, there exists a uniform TC^ root-finding algorithm for 
degree d polynomials in the sense of Definition 4.2. 

Proof: We employ the notation of Definition 4.2. By Lemma 4.3, we can assume / to be 
square- free (in which case we will have ej{f,t) = 1 for all j, so we only need to compute the 
roots). Consider the following TC° algorithm: 

(i) Using Lemma 2.1, compute an 77 > such that all roots of / are at distance at least rj 
from each other. 

(ii) Using Lemma 4.4, compute a list {r'j : j < u} such that every root of / is within 
distance 77/6 of some r'j, and vice versa. 

(iii) Note that if r'f^ and r'j correspond to the same root, then ~ ''^jl < f^' otherwise 

— r'jl > |ry. Use this criterion to omit duplicate roots from the list, creating a list 
{rj : j < d} which contains ?7/5-approximations of all roots of /, each of them exactly 
once. 

(iv) If e := 2~* > ri/5, output Zj := rj and halt. Otherwise use Lemma 4.4 to construct a 
list {z'l^ : h < s} consisting of e-approximations of roots of /. 

(v) For each j < d, output zj := z'^jy where h{j) is the smallest h < s such that — rj| < 

Notice that the computation of Vj is independent of t. Let aj be the unique root of / such that 
\aj — rj\ < r//5. Given t and i, let j' be such that \zf^ — Oji \ < e. Then \z'^ — rj\ < e + rj/h < ^rj 
if j = i'j otherwise jz^ — rj\ > ^rj — £ > ^rj. Thus, the definition of h{j) in the last step is 
sound, and guarantees \zj — aj\ < s. 

It follows that this TC° function has all the required properties, except that it computes 
approximations instead of binary expansions. We can fix this as follows. Using the algorithm 
we have just described, we can compute integers u,v such that \u + iv — 2*aj\ < 1. Then 
[Re(2*aj)J is either u ot u — 1, hence it remains to find the sign of Re(2*aj) — u (the case of 
Im is similar). 

Let g{z) = /(2-*(2z + n)), h{z) = g{-z), and a = \{2^aj - u). Then g{a) = = h{-a) 
and a— {—a) = Re(2*aj)— u. Using Lemma 2.1, we can compute ^ > such that |q— (— q)| > ^ 
whenever it is nonzero. Using the algorithm above, we can compute rational u' ,v' such that 
\u' + iv' - 2*aj| < i/A. li\u-u'\ < ^/2, then Re(2*aj) = u. Otherwise, |Re(2*aj) - n| > ^, 
hence the sign oi u' — u agrees with the sign of Re(2*aj) — u. □ 

Corollary 4.6 If a is a fixed real algebraic number, then the kth bit of a can be computed in 
uniform TC^, given k in unary. □ 

(Note that this corollary is only interesting in the uniform setting, since the language is 
unary.) 
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5 Open induction in VTC 



As we already mentioned in the introduction, our primary motivation for studying root finding 
for constant-degree polynomials comes from bounded arithmetic. We will now describe the 
connection in more detail. A reader not interested in bounded arithmetic may safely stop 
reading here. 

The basic objects of study in bounded arithmetic are weak first-order theories based on 
integer arithmetic. There is a loose correspondence of arithmetical theories to complexity 
classes: in particular, if a theory T corresponds to a class C, then the provable total com- 
putable functions of T are functions from C (or more precisely, FC). The following is one of 
the natural problems to study in this context: assume we have a concept (say, a language or 
a function) from the computational class C. Which properties of this concept are provable 
in the theory T? (This asks for a form of feasible reasoning: what can we show about the 
concept when we are restricted to tools not exceeding its complexity?) 

Here we are concerned with the theory VTC^, corresponding to TC^. We refer the reader 
to Cook and Nguyen [12] for a comprehensive treatment of VTC^. Let us briefly recall that 
VTC^ is a two-sorted theory, with one sort intended for natural numbers (which we think of 
as given in unary), and one sort for finite sets of these unary numbers (which we also regard 
as finite binary strings, or as numbers written in binary). We are primarily interested in 
the binary number sort, we consider the unary sort to be auxiliary. We use capital letters 
X,Y, . . . for variables of the binary (set) sort, and lowercase letters x,y, . . . for the unary 
sort. The language of the theory consists of basic arithmetical operations on the unary sort, 
the elementhood (or bit) predicate x e X, and a function \X\ which extracts an upper 
bound on elements of a set X. The axioms of VTC^ include comprehension for T,q formulas 
(formulas with number quantifiers bounded by a term and no set quantifiers) — which also 
implies induction on unary numbers for formulas — and an axiom ensuring the existence 
of counting functions for any set. The provably total computable (i.e., Sj-definable: T,\ 
formulas consist of a block of existential set quantifiers in front of a T,q formula) functions of 
VTC^ are the TC° functions. 

In VTC^, we can define the basic arithmetical operations -|-, •, < on binary integers. Our 
main question is, what properties of these operations are provable in VTC^. (We can make 
this more precise as follows: which theories in the usual single-sorted language of arithmetic 
LpA = (0, 1, +, •, <) are interpreted in VTC^ by the corresponding operations on the binary 
sort?) It is not hard to show that VTC^ proves binary integers to form a discretely ordered 
ring (DOR). What we would especially like to know is whether VTC^ can prove the induction 
schema on the binary sort 

ip{0) A VX {ip{X) ip{X + 1)) ^ VX ip{X) 

for some nontrivial class of formulas (p. In particular, we want to know whether VTC^ 
includes the theory lOpen (axiomatizcd by induction for open formulas of LpA over DOR) 
introduced by Shepherdson [32] and widely studied in the literature. 
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Now, assume for a moment that VTC h lOpen. Then for each constant d, VTC proves 

X <Y A F{X) < < F{Y) ^3Z{X < Z <Y A F{Z) < < F{Z + 1)) 

where F{X) = ^j^d UjX^ is a degree d integer polynomial whose coefficients are parameters 
of the formula. This is (equivalent to) a T,\ formula, hence the existential quantifier is, 
provably in VTC^, witnessed by a TC" function G{Uo, . . . ,11^, X,Y). Since any rational 
polynomial is a scalar multiple of an integer polynomial, and we can pass from a polynomial 
F{X) to 2*'^F{2-*X) to reduce the error from 1 to 2 *, we see that there is a TC° algorithm 
solving the following root-finding problem: given a degree d rational polynomial and two 
rational bounds where it assumes opposite signs, approximate a real root of the polynomial 
between the two bounds up to a given accuracy. Using a slightly more complicated argument, 
one can also obtain a root-finding algorithm in the set-up we considered earlier : i.e., we 
approximate all complex roots of the polynomial, and the input of the algorithm is only the 
polynomial and the desired error of approximation. Thus, a TC° root-finding algorithm is a 
necessary prerequisite for showing lOpen in VTC^. 

We can in a sense reverse the argument above to obtain a proof of open induction from 
a root-finding algorithm, but there is an important caveat. The way we used the witnessing 
theorem for VTC^, we lost the information that the soundness of the algorithm is provable 
m VTC^. Indeed, if we are only concerned with the computational complexity of witnessing 
functions, then witnessing of S]; formulas is unaffected by addition of true universal (i.e., 
VS^) axioms to the theory. In other words, the same argument shows the existence of a root- 
finding algorithm from the weaker assumption VTC^ + Th.^^B(N) h lOpen, where Thyjns(N) 
denotes the set of all VS^ sentences true in the standard model of arithmetic. Now, this 
formulation of the argument can be reversed: 

Theorem 5.1 The theory VTC° + T\^b{N) proves lOpen for the binary number sort. 

Proof: Let M be a model of VTC^ + Thy^s (N), and D be the discretely ordered ring of the 
binary integers of M. For any constant d, we can use Theorem 4.1 to construct a TC° function 
which, given the coefficients of an integer polynomial of degree d, computes a list of integers 
ao < ai < ■ • • < ak, k < d, such that the sign of the polynomial is constant on each of the 
integer intervals (aj,aj_|-i), (— oo,ao), {ak,+oo). This property of the function is expressible 
by a VS^ sentence (when the coefficients of the polynomial and the Uj are taken from the 
binary sort), hence it holds in D that such elements oq, . . . ,0^ exist for every polynomial over 
D. 

Any atomic formula ip{x) of LpA with parameters from D is equivalent in DOR to the 

formula f{x) < for some / G D[x], hence ip{D) := {x : D |= '^(x)} is a finite union of 
intervals. Sets of this kind form a Boolean algebra, hence (p{D) is a finite union of intervals 
for every open formula (p. This implies induction for ip: if D \= <f{0) A ^(p{u) for some u > 0, 
the interval I of (p{D) containing cannot be infinite from above, hence its larger end-point 
V e D satisfies D \= ip{v) A ^ip{v + 1). □ 

Problem 5.2 Does VTC^ prove lOpen ? 
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In light of the discussion above, Problem 5.2 is essentially equivalent to the following: are there 
TC^ root-finding algorithms for constant-degree polynomials whose correctness is provable in 
VTC^l Wc remark that the complex-analytic tools we used in the proof of Theorem 4.1 are 
not available in VTC^. 

We note that already proving the totality of integer division in VTC^ (i.e., formalization 
of a TC° integer division algorithm in VTC^) is a nontrivial open^ problem, thus Problem 5.2 
may turn out to be too ambitious a goal. The following is a still interesting version of the 
question, which may be easier to settle: 

Problem 5.3 Does VTC° + IMUL prove lOpen, where IMUL is a natural axiom postulating 
the totality of iterated integer multiplication ? 

We also mention that it is not hard to prove in VTC^ that binary integers form a Z-ring, 
which implies all universal consequences of lOpen in the language of ordered rings. The 
problem is thus only with statements with a genuinely existential import (note that lOpen is 
a V3 theory) . 
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